The Ising Model for Changes in Word Ordering Rule in Natural Languages

نویسندگان

  • Yoshiaki Itoh
  • Sumie Ueda
چکیده

The order of ‘noun and adposition’ is the important parameter of word ordering rules in the world’s languages. The seven parameters, ‘adverb and verb’ and others, have a strong dependence on the ‘noun and adposition’. Japanese as well as Korean, Tamil and several other languages seem to have a stable structure of word ordering rules, as well as Thai and other languages which have the opposite word ordering rules to Japanese. It seems that each language in the world fluctuates between these two structures like the Ising model for finite lattice. Model of changes of word ordering rules and Tsuonoda’s table Natural languages have been attractive objects of various interdisciplinary studies. The problem how the universals of languages evolve is drawing the attension of the physicists [1]. The importance of adpositions (prepositions and postpositions) as a parameter is recognized in word order typology [4, 5]. In previous studies [2, 6] using the word order data of Tsunoda (Table.1) [2], in which Japanese is taken as the standard of comparison, 130 languages are classified by using 19 word order parameters. The result is that the 130 languages are neatly divided into two groups with one exception or two: (a) prepositional languages and (b) other languages, i. e. postpositional languages and adpositionless languages. Adpositionless languages behave like postpositional languages in terms of other word order parameters. The second important parameter of word order is ‘numeral and noun’ [7]. Applying a statistical method for categorical data [8], the clustering of the 130 languages is well explained by the two word ordering parameters, ‘noun and adposition’ and ‘numeral and noun’. Greenberg [4] utilizes the position of adpositions in seven of the forty five universals he proposes (pp.110-12). For example in languages with prepositions, the genitive almost always follows the governing noun, while in languages with postpositions it almost always precedes it. Tsunoda’s data show, the parameters 1 (S, O and V), 3 (genitive and noun), 7 (relative clause and noun), 8 (proper noun and common noun), 9 (comparison of superiority), 10 (main verb and auxiliary verb) and 19 (purpose clause and main clause), have strong dependence on the parameter 2 (noun and adposition) [2, 6, 7]. For example our study of the data gives the statistical laws on cooccurrence with three parameters. (i) If a language has preposition, and if the language is the VO language, then the genitive tends to follow the noun.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Hybrid Machine Translation System Based on a Monotone Decoder

In this paper, a hybrid Machine Translation (MT) system is proposed by combining the result of a rule-based machine translation (RBMT) system with a statistical approach. The RBMT uses a set of linguistic rules for translation, which leads to better translation results in terms of word ordering and syntactic structure. On the other hand, SMT works better in lexical choice. Therefore, in our sys...

متن کامل

Magnetic Properties and Phase Transitions in a Spin-1 Random Transverse Ising Model on Simple Cubic Lattice

Within the effective-field theory with correlations (EFT), a transverse random field spin-1 Ising model on the simple cubic (z=6) lattice is studied. The phase diagrams, the behavior of critical points, transverse magnetization,  internal energy, magnetic specific heat are obtained numerically and discussed for different values of p the concentration of the random transverse field.

متن کامل

A Mathematical Optimization Model for Solving Minimum Ordering Problem with Constraint Analysis and some Generalizations

In this paper, a mathematical method is proposed to formulate a generalized ordering problem. This model is formed as a linear optimization model in which some variables are binary. The constraints of the problem have been analyzed with the emphasis on the assessment of their importance in the formulation. On the one hand, these constraints enforce conditions on an arbitrary subgraph and then g...

متن کامل

Word-Forming Process in Azeri Turkish Language

The subject intended to study the general methods of natural word-forming in Azeri Turkish language. This study aimed to reach this purpose by analyzing the construction of compound Azeri Turkish words. Same’ei (2016) did a comprehensive study on word-forming process in Farsi, which was the inspiration source of this study for Azeri Turkish language word-forming. Numerous scholars had done vari...

متن کامل

مدل ترجمه عبارت-مرزی با استفاده از برچسب‌های کم‌عمق نحوی

Phrase-boundary model for statistical machine translation labels the rules with classes of boundary words on the target side phrases of training corpus. In this paper, we extend the phrase-boundary model using shallow syntactic labels including POS tags and chunk labels. With the priority of chunk labels, the proposed model names non-terminals with shallow syntactic labels on the boundaries of ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001